Unsupervised Language Model Adaptation Using Word Classes for Spontaneous Speech Recognition
نویسندگان
چکیده
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for spontaneous speech recognition. The word classes are automatically determined by maximizing the average mutual information between the classes using a training set. A class-based language model is built based on recognition hypotheses obtained using a general word-based language model, and linearly interpolated with that general language model. All the input utterances are re-recognized using the adapted language model. It was confirmed that the proposed method is effective in improving the recognition accuracy in spontaneous presentation recognition. The proposed method was combined with acoustic model adaptation, and it was found that the effects of language model adaptation and acoustic model adaptation are additive. The optimum number of classes is 100 irrespective of whether the acoustic model adaptation is combined or not, and in this condition the language model adaptation yields approximately 2% absolute value improvement in the word accuracy.
منابع مشابه
Title Unsupervised class - based language model adaptation for spontaneous speech recognition
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for spontaneous speech recognition. The word classes are automatically determined by maximizing the average mutual information between the classes using a training set. A class-based language model is built based on recognition hypotheses obtained using a general word-based language model, and linearly...
متن کاملUnsupervised class-based language model adaptation for spontaneous speech recognition
This paper proposes an unsupervised, batch-type, class-based language model adaptation method for spontaneous speech recognition. The word classes are automatically determined by maximizing the average mutual information between the classes using a training set. A class-based language model is built based on recognition hypotheses obtained using a general word-based language model, and linearly...
متن کاملImprovement of Lecture Speech Recognition by Using Unsupervised Adaptation
The aim of this work is to improve the recognition performance of spontaneous speech. In order to achieve the purpose, the authors of this chapter propose new approaches of unsupervised adaptation for spontaneous speech and evaluate the methods by using diagonal-covariance and full-covariance hidden Markov models. In the adaptation procedure, both methods of language model (LM) adaptation and a...
متن کاملUnsupervised Language Model Adaptation for Lecture Speech Recognition
This paper addresses speaker adaptation of language model in large vocabulary spontaneous speech recognition. In spontaneous speech, the expression and pronunciation of words vary a lot depending on the speaker and topic. Therefore, we present unsupervised methods of language model adaptation to a specific speaker by (1) making direct use of the initial recognition result for generating an enha...
متن کاملCombinations of various language model technologies including data expansion and adaptation in spontaneous speech recognition
This paper demonstrates combinations of various language model (LM) technologies simultaneously, not only modeling techniques but also those for training data expansion based on external language resources and unsupervised adaptation for spontaneous speech recognition. Although forming combinations of various LM technologies has been examined, previous works focused on only modeling techniques....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003